Overview
Brought to you by YData
Dataset statistics
| Number of variables | 14 |
|---|---|
| Number of observations | 11991 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.3 MiB |
| Average record size in memory | 112.0 B |
Variable types
| Numeric | 8 |
|---|---|
| Text | 3 |
| Categorical | 2 |
| DateTime | 1 |
EmployeeNumber is highly overall correlated with origin | High correlation |
origin is highly overall correlated with EmployeeNumber | High correlation |
priceEach is highly overall correlated with sales_amount | High correlation |
quantityOrdered is highly overall correlated with sales_amount | High correlation |
sales_amount is highly overall correlated with priceEach and 1 other fields | High correlation |
status is highly imbalanced (76.3%) | Imbalance |
origin is highly imbalanced (77.8%) | Imbalance |
EmployeeNumber has 428 (3.6%) zeros | Zeros |
Reproduction
| Analysis started | 2025-02-07 11:54:23.485833 |
|---|---|
| Analysis finished | 2025-02-07 11:54:31.954446 |
| Duration | 8.47 seconds |
| Software version | ydata-profiling vv4.12.2 |
| Download configuration | config.json |
Variables
orderNumber
Real number (ℝ)
| Distinct | 326 |
|---|---|
| Distinct (%) | 2.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10268.808 |
| Minimum | 10100 |
|---|---|
| Maximum | 10425 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 93.8 KiB |
Quantile statistics
| Minimum | 10100 |
|---|---|
| 5-th percentile | 10116 |
| Q1 | 10182 |
| median | 10273 |
| Q3 | 10358 |
| 95-th percentile | 10409 |
| Maximum | 10425 |
| Range | 325 |
| Interquartile range (IQR) | 176 |
Descriptive statistics
| Standard deviation | 96.897075 |
|---|---|
| Coefficient of variation (CV) | 0.0094360589 |
| Kurtosis | -1.3122306 |
| Mean | 10268.808 |
| Median Absolute Deviation (MAD) | 87 |
| Skewness | -0.12819144 |
| Sum | 1.2313327 × 108 |
| Variance | 9389.0432 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 10386 | 234 | 2.0% |
| 10350 | 221 | 1.8% |
| 10262 | 208 | 1.7% |
| 10212 | 208 | 1.7% |
| 10358 | 182 | 1.5% |
| 10104 | 169 | 1.4% |
| 10380 | 169 | 1.4% |
| 10383 | 169 | 1.4% |
| 10153 | 169 | 1.4% |
| 10182 | 153 | 1.3% |
| Other values (316) | 10109 |
| Value | Count | Frequency (%) |
| 10100 | 12 | 0.1% |
| 10101 | 16 | 0.1% |
| 10102 | 6 | 0.1% |
| 10103 | 64 | 0.5% |
| 10104 | 169 | |
| 10105 | 60 | 0.5% |
| 10106 | 45 | 0.4% |
| 10107 | 18 | 0.2% |
| 10108 | 42 | 0.4% |
| 10109 | 15 | 0.1% |
| Value | Count | Frequency (%) |
| 10425 | 39 | |
| 10424 | 78 | |
| 10423 | 10 | 0.1% |
| 10422 | 4 | < 0.1% |
| 10421 | 18 | 0.2% |
| 10420 | 39 | |
| 10419 | 42 | |
| 10418 | 18 | 0.2% |
| 10417 | 78 | |
| 10416 | 28 | 0.2% |
orderLineNumber
Real number (ℝ)
| Distinct | 18 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.4966225 |
| Minimum | 1 |
|---|---|
| Maximum | 18 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 93.8 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 6 |
| Q3 | 9 |
| 95-th percentile | 14 |
| Maximum | 18 |
| Range | 17 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 4.2139568 |
|---|---|
| Coefficient of variation (CV) | 0.6486381 |
| Kurtosis | -0.55744911 |
| Mean | 6.4966225 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 0.58126728 |
| Sum | 77901 |
| Variance | 17.757432 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 1274 | |
| 2 | 1222 | |
| 3 | 1139 | |
| 4 | 1089 | |
| 5 | 1009 | |
| 6 | 942 | |
| 7 | 841 | 7.0% |
| 8 | 803 | 6.7% |
| 9 | 714 | 6.0% |
| 10 | 629 | 5.2% |
| Other values (8) | 2329 |
| Value | Count | Frequency (%) |
| 1 | 1274 | |
| 2 | 1222 | |
| 3 | 1139 | |
| 4 | 1089 | |
| 5 | 1009 | |
| 6 | 942 | |
| 7 | 841 | |
| 8 | 803 | |
| 9 | 714 | |
| 10 | 629 |
| Value | Count | Frequency (%) |
| 18 | 42 | 0.4% |
| 17 | 110 | 0.9% |
| 16 | 188 | 1.6% |
| 15 | 234 | 2.0% |
| 14 | 312 | |
| 13 | 421 | |
| 12 | 459 | |
| 11 | 563 | |
| 10 | 629 | |
| 9 | 714 |
customerNumber
Real number (ℝ)
| Distinct | 98 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 214.96289 |
| Minimum | 103 |
|---|---|
| Maximum | 496 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 93.8 KiB |
Quantile statistics
| Minimum | 103 |
|---|---|
| 5-th percentile | 121 |
| Q1 | 141 |
| median | 145 |
| Q3 | 298 |
| 95-th percentile | 456 |
| Maximum | 496 |
| Range | 393 |
| Interquartile range (IQR) | 157 |
Descriptive statistics
| Standard deviation | 111.08194 |
|---|---|
| Coefficient of variation (CV) | 0.51674937 |
| Kurtosis | -0.15746026 |
| Mean | 214.96289 |
| Median Absolute Deviation (MAD) | 21 |
| Skewness | 1.0877225 |
| Sum | 2577620 |
| Variance | 12339.197 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 141 | 3367 | |
| 124 | 1620 | 13.5% |
| 114 | 220 | 1.8% |
| 151 | 192 | 1.6% |
| 276 | 184 | 1.5% |
| 323 | 184 | 1.5% |
| 148 | 172 | 1.4% |
| 353 | 164 | 1.4% |
| 119 | 159 | 1.3% |
| 187 | 153 | 1.3% |
| Other values (88) | 5576 |
| Value | Count | Frequency (%) |
| 103 | 21 | 0.2% |
| 112 | 87 | 0.7% |
| 114 | 220 | 1.8% |
| 119 | 159 | 1.3% |
| 121 | 128 | 1.1% |
| 124 | 1620 | |
| 128 | 88 | 0.7% |
| 129 | 63 | 0.5% |
| 131 | 141 | 1.2% |
| 141 | 3367 |
| Value | Count | Frequency (%) |
| 496 | 144 | |
| 495 | 36 | 0.3% |
| 489 | 24 | 0.2% |
| 487 | 30 | 0.3% |
| 486 | 66 | |
| 484 | 30 | 0.3% |
| 475 | 26 | 0.2% |
| 473 | 16 | 0.1% |
| 471 | 46 | 0.4% |
| 462 | 78 |
EmployeeNumber
Real number (ℝ)
High correlation  Zeros 
| Distinct | 15 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1313.1542 |
| Minimum | 0 |
|---|---|
| Maximum | 1702 |
| Zeros | 428 |
| Zeros (%) | 3.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 93.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1165 |
| Q1 | 1216 |
| median | 1370 |
| Q3 | 1401 |
| 95-th percentile | 1612 |
| Maximum | 1702 |
| Range | 1702 |
| Interquartile range (IQR) | 185 |
Descriptive statistics
| Standard deviation | 289.29432 |
|---|---|
| Coefficient of variation (CV) | 0.22030491 |
| Kurtosis | 12.382166 |
| Mean | 1313.1542 |
| Median Absolute Deviation (MAD) | 84 |
| Skewness | -3.1706246 |
| Sum | 15746032 |
| Variance | 83691.203 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1370 | 3740 | |
| 1165 | 1971 | |
| 1401 | 726 | 6.1% |
| 1501 | 641 | 5.3% |
| 1611 | 633 | 5.3% |
| 1337 | 572 | 4.8% |
| 1612 | 546 | 4.6% |
| 1504 | 534 | 4.5% |
| 1323 | 529 | 4.4% |
| 1286 | 447 | 3.7% |
| Other values (5) | 1652 |
| Value | Count | Frequency (%) |
| 0 | 428 | 3.6% |
| 1165 | 1971 | |
| 1166 | 262 | 2.2% |
| 1188 | 307 | 2.6% |
| 1216 | 372 | 3.1% |
| 1286 | 447 | 3.7% |
| 1323 | 529 | 4.4% |
| 1337 | 572 | 4.8% |
| 1370 | 3740 | |
| 1401 | 726 | 6.1% |
| Value | Count | Frequency (%) |
| 1702 | 283 | 2.4% |
| 1612 | 546 | 4.6% |
| 1611 | 633 | 5.3% |
| 1504 | 534 | 4.5% |
| 1501 | 641 | 5.3% |
| 1401 | 726 | 6.1% |
| 1370 | 3740 | |
| 1337 | 572 | 4.8% |
| 1323 | 529 | 4.4% |
| 1286 | 447 | 3.7% |
productCode
Text
| Distinct | 109 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 93.8 KiB |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 8.1014928 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | S24_3969 |
|---|---|
| 2nd row | S24_3969 |
| 3rd row | S24_3969 |
| 4th row | S18_2248 |
| 5th row | S18_2248 |
| Value | Count | Frequency (%) |
| s18_3232 | 242 | 2.0% |
| s24_2840 | 164 | 1.4% |
| s24_1444 | 159 | 1.3% |
| s32_2509 | 156 | 1.3% |
| s50_1392 | 150 | 1.3% |
| s24_4048 | 147 | 1.2% |
| s18_2238 | 146 | 1.2% |
| s12_4473 | 145 | 1.2% |
| s18_2319 | 143 | 1.2% |
| s32_3207 | 137 | 1.1% |
| Other values (99) | 10402 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 13986 | |
| S | 11991 | |
| _ | 11991 | |
| 1 | 11909 | |
| 4 | 8871 | |
| 8 | 8555 | |
| 3 | 7519 | |
| 0 | 7276 | |
| 7 | 4393 | 4.5% |
| 9 | 3925 | 4.0% |
| Other values (2) | 6729 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 97145 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2 | 13986 | |
| S | 11991 | |
| _ | 11991 | |
| 1 | 11909 | |
| 4 | 8871 | |
| 8 | 8555 | |
| 3 | 7519 | |
| 0 | 7276 | |
| 7 | 4393 | 4.5% |
| 9 | 3925 | 4.0% |
| Other values (2) | 6729 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 97145 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2 | 13986 | |
| S | 11991 | |
| _ | 11991 | |
| 1 | 11909 | |
| 4 | 8871 | |
| 8 | 8555 | |
| 3 | 7519 | |
| 0 | 7276 | |
| 7 | 4393 | 4.5% |
| 9 | 3925 | 4.0% |
| Other values (2) | 6729 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 97145 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2 | 13986 | |
| S | 11991 | |
| _ | 11991 | |
| 1 | 11909 | |
| 4 | 8871 | |
| 8 | 8555 | |
| 3 | 7519 | |
| 0 | 7276 | |
| 7 | 4393 | 4.5% |
| 9 | 3925 | 4.0% |
| Other values (2) | 6729 |
status
Categorical
Imbalance 
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 93.8 KiB |
| Shipped | |
|---|---|
| Cancelled | 357 |
| Resolved | 329 |
| In Process | 188 |
| Disputed | 100 |
Length
| Max length | 10 |
|---|---|
| Median length | 7 |
| Mean length | 7.1423568 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Shipped |
|---|---|
| 2nd row | Shipped |
| 3rd row | Shipped |
| 4th row | Shipped |
| 5th row | Shipped |
Common Values
| Value | Count | Frequency (%) |
| Shipped | 10941 | |
| Cancelled | 357 | 3.0% |
| Resolved | 329 | 2.7% |
| In Process | 188 | 1.6% |
| Disputed | 100 | 0.8% |
| On Hold | 76 | 0.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| shipped | 10941 | |
| cancelled | 357 | 2.9% |
| resolved | 329 | 2.7% |
| in | 188 | 1.5% |
| process | 188 | 1.5% |
| disputed | 100 | 0.8% |
| on | 76 | 0.6% |
| hold | 76 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| p | 21982 | |
| e | 12601 | |
| d | 11803 | |
| i | 11041 | |
| S | 10941 | |
| h | 10941 | |
| l | 1119 | 1.3% |
| s | 805 | 0.9% |
| n | 621 | 0.7% |
| o | 593 | 0.7% |
| Other values (14) | 3197 | 3.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 85644 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| p | 21982 | |
| e | 12601 | |
| d | 11803 | |
| i | 11041 | |
| S | 10941 | |
| h | 10941 | |
| l | 1119 | 1.3% |
| s | 805 | 0.9% |
| n | 621 | 0.7% |
| o | 593 | 0.7% |
| Other values (14) | 3197 | 3.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 85644 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| p | 21982 | |
| e | 12601 | |
| d | 11803 | |
| i | 11041 | |
| S | 10941 | |
| h | 10941 | |
| l | 1119 | 1.3% |
| s | 805 | 0.9% |
| n | 621 | 0.7% |
| o | 593 | 0.7% |
| Other values (14) | 3197 | 3.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 85644 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| p | 21982 | |
| e | 12601 | |
| d | 11803 | |
| i | 11041 | |
| S | 10941 | |
| h | 10941 | |
| l | 1119 | 1.3% |
| s | 805 | 0.9% |
| n | 621 | 0.7% |
| o | 593 | 0.7% |
| Other values (14) | 3197 | 3.7% |
quantityOrdered
Real number (ℝ)
High correlation 
| Distinct | 61 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 35.371362 |
| Minimum | 6 |
|---|---|
| Maximum | 97 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 93.8 KiB |
Quantile statistics
| Minimum | 6 |
|---|---|
| 5-th percentile | 21 |
| Q1 | 27 |
| median | 35 |
| Q3 | 43 |
| 95-th percentile | 49 |
| Maximum | 97 |
| Range | 91 |
| Interquartile range (IQR) | 16 |
Descriptive statistics
| Standard deviation | 9.6569487 |
|---|---|
| Coefficient of variation (CV) | 0.27301603 |
| Kurtosis | 0.39201569 |
| Mean | 35.371362 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.35269445 |
| Sum | 424138 |
| Variance | 93.256658 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 41 | 488 | 4.1% |
| 34 | 464 | 3.9% |
| 49 | 451 | 3.8% |
| 32 | 448 | 3.7% |
| 44 | 432 | 3.6% |
| 33 | 430 | 3.6% |
| 20 | 421 | 3.5% |
| 31 | 419 | 3.5% |
| 29 | 417 | 3.5% |
| 46 | 416 | 3.5% |
| Other values (51) | 7605 |
| Value | Count | Frequency (%) |
| 6 | 4 | < 0.1% |
| 10 | 7 | 0.1% |
| 11 | 5 | < 0.1% |
| 12 | 3 | < 0.1% |
| 13 | 1 | < 0.1% |
| 15 | 12 | 0.1% |
| 16 | 2 | < 0.1% |
| 18 | 7 | 0.1% |
| 19 | 16 | 0.1% |
| 20 | 421 |
| Value | Count | Frequency (%) |
| 97 | 3 | < 0.1% |
| 90 | 4 | < 0.1% |
| 85 | 2 | < 0.1% |
| 77 | 6 | 0.1% |
| 76 | 5 | < 0.1% |
| 70 | 16 | |
| 66 | 21 | |
| 65 | 6 | 0.1% |
| 64 | 8 | 0.1% |
| 62 | 2 | < 0.1% |
priceEach
Real number (ℝ)
High correlation 
| Distinct | 1572 |
|---|---|
| Distinct (%) | 13.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 90.502459 |
| Minimum | 26.55 |
|---|---|
| Maximum | 214.3 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 93.8 KiB |
Quantile statistics
| Minimum | 26.55 |
|---|---|
| 5-th percentile | 37.38 |
| Q1 | 60.94 |
| median | 85.87 |
| Q3 | 115.03 |
| 95-th percentile | 157.49 |
| Maximum | 214.3 |
| Range | 187.75 |
| Interquartile range (IQR) | 54.09 |
Descriptive statistics
| Standard deviation | 36.7853 |
|---|---|
| Coefficient of variation (CV) | 0.40645636 |
| Kurtosis | -0.04781738 |
| Mean | 90.502459 |
| Median Absolute Deviation (MAD) | 26.87 |
| Skewness | 0.58830573 |
| Sum | 1085215 |
| Variance | 1353.1583 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 117.48 | 50 | 0.4% |
| 77.24 | 43 | 0.4% |
| 66.99 | 40 | 0.3% |
| 111.57 | 36 | 0.3% |
| 43.27 | 35 | 0.3% |
| 157.49 | 35 | 0.3% |
| 48.8 | 34 | 0.3% |
| 84.33 | 33 | 0.3% |
| 77.05 | 32 | 0.3% |
| 60.3 | 32 | 0.3% |
| Other values (1562) | 11621 |
| Value | Count | Frequency (%) |
| 26.55 | 7 | 0.1% |
| 27.22 | 3 | < 0.1% |
| 27.55 | 2 | < 0.1% |
| 27.88 | 28 | |
| 28.64 | 12 | |
| 28.88 | 7 | 0.1% |
| 29.21 | 2 | < 0.1% |
| 29.35 | 6 | 0.1% |
| 29.54 | 3 | < 0.1% |
| 29.87 | 21 |
| Value | Count | Frequency (%) |
| 214.3 | 15 | |
| 212.16 | 3 | < 0.1% |
| 210.01 | 2 | < 0.1% |
| 207.87 | 3 | < 0.1% |
| 207.8 | 3 | < 0.1% |
| 205.73 | 10 | |
| 205.72 | 7 | |
| 203.64 | 2 | < 0.1% |
| 203.59 | 5 | < 0.1% |
| 201.57 | 16 |
sales_amount
Real number (ℝ)
High correlation 
| Distinct | 2878 |
|---|---|
| Distinct (%) | 24.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3210.0436 |
| Minimum | 481.5 |
|---|---|
| Maximum | 11503.14 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 93.8 KiB |
Quantile statistics
| Minimum | 481.5 |
|---|---|
| 5-th percentile | 1157.2 |
| Q1 | 1988.2 |
| median | 2866.2 |
| Q3 | 4091.34 |
| 95-th percentile | 6366 |
| Maximum | 11503.14 |
| Range | 11021.64 |
| Interquartile range (IQR) | 2103.14 |
Descriptive statistics
| Standard deviation | 1641.1488 |
|---|---|
| Coefficient of variation (CV) | 0.51125437 |
| Kurtosis | 1.4735427 |
| Mean | 3210.0436 |
| Median Absolute Deviation (MAD) | 1009.32 |
| Skewness | 1.1035407 |
| Sum | 38491632 |
| Variance | 2693369.4 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4295.04 | 26 | 0.2% |
| 5978.98 | 22 | 0.2% |
| 1818.25 | 22 | 0.2% |
| 3390.2 | 22 | 0.2% |
| 3314.44 | 22 | 0.2% |
| 4011.5 | 18 | 0.2% |
| 3986.4 | 17 | 0.1% |
| 3142.8 | 17 | 0.1% |
| 4529.6 | 17 | 0.1% |
| 2273.92 | 17 | 0.1% |
| Other values (2868) | 11791 |
| Value | Count | Frequency (%) |
| 481.5 | 3 | |
| 529.35 | 3 | |
| 531 | 3 | |
| 546.66 | 1 | < 0.1% |
| 553.52 | 3 | |
| 557.6 | 3 | |
| 577.6 | 3 | |
| 597.4 | 2 | |
| 615 | 4 | |
| 625.5 | 3 |
| Value | Count | Frequency (%) |
| 11503.14 | 2 | < 0.1% |
| 11170.52 | 3 | < 0.1% |
| 10723.6 | 1 | < 0.1% |
| 10460.16 | 4 | < 0.1% |
| 10286.4 | 9 | |
| 10072 | 13 | |
| 9974.4 | 3 | < 0.1% |
| 9712.04 | 3 | < 0.1% |
| 9571.08 | 2 | < 0.1% |
| 9568.73 | 2 | < 0.1% |
origin
Categorical
High correlation  Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 93.8 KiB |
| spain | |
|---|---|
| japan | 428 |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | spain |
|---|---|
| 2nd row | spain |
| 3rd row | spain |
| 4th row | spain |
| 5th row | spain |
Common Values
| Value | Count | Frequency (%) |
| spain | 11563 | |
| japan | 428 | 3.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| spain | 11563 | |
| japan | 428 | 3.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 12419 | |
| p | 11991 | |
| n | 11991 | |
| s | 11563 | |
| i | 11563 | |
| j | 428 | 0.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 59955 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 12419 | |
| p | 11991 | |
| n | 11991 | |
| s | 11563 | |
| i | 11563 | |
| j | 428 | 0.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 59955 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 12419 | |
| p | 11991 | |
| n | 11991 | |
| s | 11563 | |
| i | 11563 | |
| j | 428 | 0.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 59955 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 12419 | |
| p | 11991 | |
| n | 11991 | |
| s | 11563 | |
| i | 11563 | |
| j | 428 | 0.7% |
| Distinct | 2988 |
|---|---|
| Distinct (%) | 24.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 93.8 KiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.246685 |
| Min length | 7 |
Unique
| Unique | 95 ? |
|---|---|
| Unique (%) | 0.8% |
Sample
| 1st row | 10100-1 |
|---|---|
| 2nd row | 10100-1 |
| 3rd row | 10100-1 |
| 4th row | 10100-2 |
| 5th row | 10100-2 |
| Value | Count | Frequency (%) |
| 10153-7 | 13 | 0.1% |
| 10424-4 | 13 | 0.1% |
| 10424-6 | 13 | 0.1% |
| 10205-3 | 13 | 0.1% |
| 10205-5 | 13 | 0.1% |
| 10153-10 | 13 | 0.1% |
| 10279-4 | 13 | 0.1% |
| 10104-6 | 13 | 0.1% |
| 10104-5 | 13 | 0.1% |
| 10104-4 | 13 | 0.1% |
| Other values (2978) | 11861 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 22914 | |
| 0 | 15220 | |
| - | 11991 | |
| 3 | 8350 | 9.6% |
| 2 | 7889 | 9.1% |
| 4 | 4403 | 5.1% |
| 5 | 3769 | 4.3% |
| 8 | 3528 | 4.1% |
| 6 | 3152 | 3.6% |
| 7 | 2997 | 3.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 86895 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 22914 | |
| 0 | 15220 | |
| - | 11991 | |
| 3 | 8350 | 9.6% |
| 2 | 7889 | 9.1% |
| 4 | 4403 | 5.1% |
| 5 | 3769 | 4.3% |
| 8 | 3528 | 4.1% |
| 6 | 3152 | 3.6% |
| 7 | 2997 | 3.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 86895 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 22914 | |
| 0 | 15220 | |
| - | 11991 | |
| 3 | 8350 | 9.6% |
| 2 | 7889 | 9.1% |
| 4 | 4403 | 5.1% |
| 5 | 3769 | 4.3% |
| 8 | 3528 | 4.1% |
| 6 | 3152 | 3.6% |
| 7 | 2997 | 3.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 86895 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 22914 | |
| 0 | 15220 | |
| - | 11991 | |
| 3 | 8350 | 9.6% |
| 2 | 7889 | 9.1% |
| 4 | 4403 | 5.1% |
| 5 | 3769 | 4.3% |
| 8 | 3528 | 4.1% |
| 6 | 3152 | 3.6% |
| 7 | 2997 | 3.4% |
checkNumber
Text
| Distinct | 273 |
|---|---|
| Distinct (%) | 2.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 93.8 KiB |
Length
| Max length | 11 |
|---|---|
| Median length | 8 |
| Mean length | 7.8369611 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | HL575273 |
|---|---|
| 2nd row | IS232033 |
| 3rd row | PN238558 |
| 4th row | HL575273 |
| 5th row | IS232033 |
| Value | Count | Frequency (%) |
| mc46946 | 259 | 2.2% |
| au364101 | 259 | 2.2% |
| jn355280 | 259 | 2.2% |
| je105477 | 259 | 2.2% |
| nu627706 | 259 | 2.2% |
| mf629602 | 259 | 2.2% |
| kt52578 | 259 | 2.2% |
| jn722010 | 259 | 2.2% |
| db583216 | 259 | 2.2% |
| dl460618 | 259 | 2.2% |
| Other values (263) | 9401 |
Most occurring characters
| Value | Count | Frequency (%) |
| 6 | 9103 | 9.7% |
| 2 | 7907 | 8.4% |
| 4 | 7808 | 8.3% |
| 7 | 7595 | 8.1% |
| 1 | 7176 | 7.6% |
| 8 | 6787 | 7.2% |
| 5 | 6505 | 6.9% |
| 3 | 6083 | 6.5% |
| 0 | 5672 | 6.0% |
| 9 | 5078 | 5.4% |
| Other values (21) | 24259 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 93973 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 6 | 9103 | 9.7% |
| 2 | 7907 | 8.4% |
| 4 | 7808 | 8.3% |
| 7 | 7595 | 8.1% |
| 1 | 7176 | 7.6% |
| 8 | 6787 | 7.2% |
| 5 | 6505 | 6.9% |
| 3 | 6083 | 6.5% |
| 0 | 5672 | 6.0% |
| 9 | 5078 | 5.4% |
| Other values (21) | 24259 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 93973 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 6 | 9103 | 9.7% |
| 2 | 7907 | 8.4% |
| 4 | 7808 | 8.3% |
| 7 | 7595 | 8.1% |
| 1 | 7176 | 7.6% |
| 8 | 6787 | 7.2% |
| 5 | 6505 | 6.9% |
| 3 | 6083 | 6.5% |
| 0 | 5672 | 6.0% |
| 9 | 5078 | 5.4% |
| Other values (21) | 24259 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 93973 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 6 | 9103 | 9.7% |
| 2 | 7907 | 8.4% |
| 4 | 7808 | 8.3% |
| 7 | 7595 | 8.1% |
| 1 | 7176 | 7.6% |
| 8 | 6787 | 7.2% |
| 5 | 6505 | 6.9% |
| 3 | 6083 | 6.5% |
| 0 | 5672 | 6.0% |
| 9 | 5078 | 5.4% |
| Other values (21) | 24259 |
paymentDate
Date
| Distinct | 232 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 93.8 KiB |
| Minimum | 2003-01-16 00:00:00 |
|---|---|
| Maximum | 2005-06-09 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
amount
Real number (ℝ)
| Distinct | 273 |
|---|---|
| Distinct (%) | 2.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 43562.704 |
| Minimum | 615.45 |
|---|---|
| Maximum | 120166.58 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 93.8 KiB |
Quantile statistics
| Minimum | 615.45 |
|---|---|
| 5-th percentile | 6419.84 |
| Q1 | 26155.91 |
| median | 39580.6 |
| Q3 | 52151.81 |
| 95-th percentile | 111654.4 |
| Maximum | 120166.58 |
| Range | 119551.13 |
| Interquartile range (IQR) | 25995.9 |
Descriptive statistics
| Standard deviation | 27278.616 |
|---|---|
| Coefficient of variation (CV) | 0.62619199 |
| Kurtosis | 1.1211273 |
| Mean | 43562.704 |
| Median Absolute Deviation (MAD) | 13424.69 |
| Skewness | 1.0977008 |
| Sum | 5.2236038 × 108 |
| Variance | 7.4412291 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 35420.74 | 259 | 2.2% |
| 49539.37 | 259 | 2.2% |
| 65071.26 | 259 | 2.2% |
| 116208.4 | 259 | 2.2% |
| 59830.55 | 259 | 2.2% |
| 46895.48 | 259 | 2.2% |
| 36140.38 | 259 | 2.2% |
| 36251.03 | 259 | 2.2% |
| 40206.2 | 259 | 2.2% |
| 63843.55 | 259 | 2.2% |
| Other values (263) | 9401 |
| Value | Count | Frequency (%) |
| 615.45 | 32 | |
| 1128.2 | 8 | 0.1% |
| 1491.38 | 32 | |
| 1627.56 | 8 | 0.1% |
| 1676.14 | 7 | 0.1% |
| 1679.92 | 10 | 0.1% |
| 1834.56 | 25 | |
| 1960.8 | 23 | |
| 2434.25 | 34 | |
| 2611.84 | 43 |
| Value | Count | Frequency (%) |
| 120166.58 | 259 | |
| 116208.4 | 259 | |
| 111654.4 | 180 | |
| 105743 | 43 | 0.4% |
| 101244.59 | 180 | |
| 85559.12 | 41 | 0.3% |
| 85410.87 | 180 | |
| 85024.46 | 29 | 0.2% |
| 83598.04 | 180 | |
| 82261.22 | 55 | 0.5% |
Interactions
Correlations
| EmployeeNumber | amount | customerNumber | orderLineNumber | orderNumber | origin | priceEach | quantityOrdered | sales_amount | status | |
|---|---|---|---|---|---|---|---|---|---|---|
| EmployeeNumber | 1.000 | -0.121 | 0.262 | -0.022 | 0.040 | 1.000 | -0.018 | -0.020 | -0.023 | 0.122 |
| amount | -0.121 | 1.000 | -0.332 | 0.087 | 0.077 | 0.188 | -0.004 | 0.019 | 0.002 | 0.072 |
| customerNumber | 0.262 | -0.332 | 1.000 | -0.048 | -0.073 | 0.438 | -0.006 | -0.010 | -0.001 | 0.126 |
| orderLineNumber | -0.022 | 0.087 | -0.048 | 1.000 | -0.039 | 0.019 | 0.018 | -0.033 | -0.004 | 0.071 |
| orderNumber | 0.040 | 0.077 | -0.073 | -0.039 | 1.000 | 0.158 | -0.005 | 0.043 | 0.018 | 0.356 |
| origin | 1.000 | 0.188 | 0.438 | 0.019 | 0.158 | 1.000 | 0.070 | 0.034 | 0.037 | 0.056 |
| priceEach | -0.018 | -0.004 | -0.006 | 0.018 | -0.005 | 0.070 | 1.000 | 0.019 | 0.829 | 0.079 |
| quantityOrdered | -0.020 | 0.019 | -0.010 | -0.033 | 0.043 | 0.034 | 0.019 | 1.000 | 0.546 | 0.193 |
| sales_amount | -0.023 | 0.002 | -0.001 | -0.004 | 0.018 | 0.037 | 0.829 | 0.546 | 1.000 | 0.119 |
| status | 0.122 | 0.072 | 0.126 | 0.071 | 0.356 | 0.056 | 0.079 | 0.193 | 0.119 | 1.000 |
Missing values
Sample
| orderNumber | orderLineNumber | customerNumber | EmployeeNumber | productCode | status | quantityOrdered | priceEach | sales_amount | origin | complete_order_number | checkNumber | paymentDate | amount | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 10100 | 1 | 363 | 1216 | S24_3969 | Shipped | 49 | 35.29 | 1729.21 | spain | 10100-1 | HL575273 | 2004-11-17 | 50799.69 |
| 1 | 10100 | 1 | 363 | 1216 | S24_3969 | Shipped | 49 | 35.29 | 1729.21 | spain | 10100-1 | IS232033 | 2003-01-16 | 10223.83 |
| 2 | 10100 | 1 | 363 | 1216 | S24_3969 | Shipped | 49 | 35.29 | 1729.21 | spain | 10100-1 | PN238558 | 2003-12-05 | 55425.77 |
| 3 | 10100 | 2 | 363 | 1216 | S18_2248 | Shipped | 50 | 55.09 | 2754.50 | spain | 10100-2 | HL575273 | 2004-11-17 | 50799.69 |
| 4 | 10100 | 2 | 363 | 1216 | S18_2248 | Shipped | 50 | 55.09 | 2754.50 | spain | 10100-2 | IS232033 | 2003-01-16 | 10223.83 |
| 5 | 10100 | 2 | 363 | 1216 | S18_2248 | Shipped | 50 | 55.09 | 2754.50 | spain | 10100-2 | PN238558 | 2003-12-05 | 55425.77 |
| 6 | 10100 | 3 | 363 | 1216 | S18_1749 | Shipped | 30 | 136.00 | 4080.00 | spain | 10100-3 | HL575273 | 2004-11-17 | 50799.69 |
| 7 | 10100 | 3 | 363 | 1216 | S18_1749 | Shipped | 30 | 136.00 | 4080.00 | spain | 10100-3 | IS232033 | 2003-01-16 | 10223.83 |
| 8 | 10100 | 3 | 363 | 1216 | S18_1749 | Shipped | 30 | 136.00 | 4080.00 | spain | 10100-3 | PN238558 | 2003-12-05 | 55425.77 |
| 9 | 10100 | 4 | 363 | 1216 | S18_4409 | Shipped | 22 | 75.46 | 1660.12 | spain | 10100-4 | HL575273 | 2004-11-17 | 50799.69 |
| orderNumber | orderLineNumber | customerNumber | EmployeeNumber | productCode | status | quantityOrdered | priceEach | sales_amount | origin | complete_order_number | checkNumber | paymentDate | amount | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 11981 | 10425 | 10 | 119 | 1370 | S18_2432 | In Process | 19 | 48.62 | 923.78 | spain | 10425-10 | NG94694 | 2005-02-22 | 49523.67 |
| 11982 | 10425 | 11 | 119 | 1370 | S32_1268 | In Process | 41 | 83.79 | 3435.39 | spain | 10425-11 | DB933704 | 2004-11-14 | 19501.82 |
| 11983 | 10425 | 11 | 119 | 1370 | S32_1268 | In Process | 41 | 83.79 | 3435.39 | spain | 10425-11 | LN373447 | 2004-08-08 | 47924.19 |
| 11984 | 10425 | 11 | 119 | 1370 | S32_1268 | In Process | 41 | 83.79 | 3435.39 | spain | 10425-11 | NG94694 | 2005-02-22 | 49523.67 |
| 11985 | 10425 | 12 | 119 | 1370 | S10_4962 | In Process | 38 | 131.49 | 4996.62 | spain | 10425-12 | DB933704 | 2004-11-14 | 19501.82 |
| 11986 | 10425 | 12 | 119 | 1370 | S10_4962 | In Process | 38 | 131.49 | 4996.62 | spain | 10425-12 | LN373447 | 2004-08-08 | 47924.19 |
| 11987 | 10425 | 12 | 119 | 1370 | S10_4962 | In Process | 38 | 131.49 | 4996.62 | spain | 10425-12 | NG94694 | 2005-02-22 | 49523.67 |
| 11988 | 10425 | 13 | 119 | 1370 | S18_4600 | In Process | 38 | 107.76 | 4094.88 | spain | 10425-13 | DB933704 | 2004-11-14 | 19501.82 |
| 11989 | 10425 | 13 | 119 | 1370 | S18_4600 | In Process | 38 | 107.76 | 4094.88 | spain | 10425-13 | LN373447 | 2004-08-08 | 47924.19 |
| 11990 | 10425 | 13 | 119 | 1370 | S18_4600 | In Process | 38 | 107.76 | 4094.88 | spain | 10425-13 | NG94694 | 2005-02-22 | 49523.67 |